Native Language Identification using Phonetic Algorithms
نویسندگان
چکیده
In this paper, we discuss the results of the IUCL system in the NLI Shared Task 2017. For our system, we explore a variety of phonetic algorithms to generate features for Native Language Identification. These features are contrasted with one of the most successful type of features in NLI, character n-grams. We find that although phonetic features do not perform as well as character n-grams alone, they do increase overall F1 score when used together with character n-grams.
منابع مشابه
مقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملVocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech
We propose in this work a novel acoustic phonetic study for Arabic people suffering from language disabilities and non-native learners of Arabic language to classify Arabic continuous speech to pathological or healthy and to identify phonemes that pose pronunciation problems (case of pathological speeches). The main idea can be summarized in comparing between the phonetic model reference to Ara...
متن کاملSpeech Workshop: New Perspectives on Developmental Models
In this model, speech perception by adults is characterized as an active, information-seeking process whereby native listeners detect the most reliable acoustic parameters that specify phonetic segments and sequences, using highly over-learned, automatic selective perception routines. In laboratory perceptual tasks, differentiation of native L1 phonetic contrasts is rapid and robust in suboptim...
متن کاملAutomatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities
Automatic evaluation of non-native speech accentedness has potential implications for not only language learning and accent identification systems but also for speaker and speech recognition systems. From the perspective of speech production, the two primary factors influencing the accentedness are the phonetic and prosodic structure. In this paper, we propose an approach for automatic accented...
متن کاملA generic methodology of converting transliterated text to phonetic strings case study: greeklish
In this work, we present a generic methodology for converting transliterated text (native language written with a non-native alphabet) to phonetic sequences. The goal is to create the same phonetic result that would be produced if a native speaker uttered the original text in native alphabet. In our work, we implemented the specific methodology as a front-end to a Text-to-Speech (TTS) server. T...
متن کامل